CLAIMS 



What is claimed is: 




t\nc 



1. A method for enhancing voice activity detection comprising: 
determining a peak-to-mednJikelihood ratio; and 
comparing the peak-to-meam likelihood ratio to a selected threshold to 



deterpine whether a current audio urame represents a voice signal. 



2. The method of claim 1, wherein prior to determining the peak- 
to-mean likelihood ratio, the method further comprises: 

determining a short-term averaged energy for the dirrent audio frame; 

and 

determining a long-term averaged energy fdr the current audio frame. 



3. l^e method of claim 2, wherein after determining the short- 
term averaged wergy and the long-term averaged energy, the method further 
comprises: 

^^lether a sum of the short-term averaged energy and a 



determi 
factor is great 

determiK^ng^thY the current audio frame represents silence if the sum 
is less than the Idng-teEm averaged energy, without necessitating a 
determination of the peak-to-mean likelihood ratio. 



iythe4ong-term averaged energy; and 
thVt 
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4. The method of claim 3, upon determining that the sum is 
greater than the long-term averaged energy and before determining the peak- 



to-meW likelihood ratio, the method further comprises: 



^termining whether a difference between the long-term averaged 
energy arm the short-term averaged energy is less than a predetermined 
threshold; \ / 

determining that the current audio frame represents voice if the 
difference is greater than the predetermined threshold; and 

continuin^by determining the peak-to-mean likelihood ratio if the 
difference is less than the predetermined threshold. 




5. The memod of claim 2, wherein the determining of the short- 
term averaged energy comprises: 

determining an. energy, in decibels, of the current audio frame; 
determining a short-term averaged energy for a prior audio frame; and 
conducting a weighted average of the energy of the current audio frame 
and the short-term average^ energy for the prior audio frame. 

6. The method of claim l,Yvherein the determining a peak-to- 
ea^ likelihood ratio comprises 

calculating an averaged peak-to-^ean ratio for the current audio 

fram|p; 

determining a maximum averaged\peak-to-mean ratio; 
determining a minimum averaged f?eak-to-mean ratio; 
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a difference between the maximum 
kveraged peak-to-mean ratio for the 



7 determining a first result being 

8 averaged peak-to-mean ratio and the 
current audio frame; 

determining a second result being a difference between the maximum 



9 

10 

11 averaged peak-to-mean ratio and the 



minimum averaged peak-to-mean ratio; 



12 and 

13 conducting a ratio between the &rst result and the second result to 

14 produce the peak-to-mean likelihood r^itio. 

1 7. A communication module Icomprising: 

2 a substrate; 

3 a processing unit placed on th^ sul^stSfate; and 

4 a memory coupled to the procfessirig unit, the memory to contain a 

5 voice activity detector which, when execufeaHSy the processing unit, analyzes 

6 a short-term averaged energy, a long-term averaged energy, and a peak-to- 

7 mean likelihood ratio in order to determine whether a current audio frame 

8 represents voice or silence. 



8. The communication module of clainr'Trwherein the voice 

2 activity detector, when executed^^go^ttfols the processing unit to determine 

3 whether a sum of the^befMerm averaged energy and a predetermined factor 

4 is greater than^-dte long-term averaged energy, and to signal that the current 

5 audio fpmie represents silence if the sum is less than the long-term averaged 

6 j^itergy. 



'1 



9. The communication module of claim 8, wherein the voice 
activity detector, vvhen executed, controls the processing unit to determine 
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3 whether a difference between the long-term averaged energy and the short- 

4 term a^^eraged energy is less than a predetermined threshold, and to signal 

5 that the current audio frame represents voice if the difference is greater than 

6 the predetern^ined threshold. 

1 10. The commm\ication module of claim 9, wherein the voice 

2 activity detector, when exec^l^d, cofttibls the processing unit to determine 

3 the peak-to-mean likelihood ratusL and fer^ompare the peak-to-mean 

4 likelihood ratio to a selected threshoy to determine whether a current audio 

5 frame represents a voice signal. 



1 11. The communication module of claim 10, wherein the voice 

2 activity detector, when executed, controls the processing unit to determine a 

3 peak-to-mean ratio by (i) sampling an analog signal a predetermined number 

4 of times to produce a plurality orSsampled signals each having a sampled 

5 value, (ii) determining a maximumWalue of the plurality of sampled signals, 

6 and (iii) conducting a ratio between afci absolute value of the maximum value 

7 and a summation of the sampled values\or the plurality of sampled signals. 

1 12. The communication module of clainv^, wherein the voice 

activity detector, when executed, controls the/|5rocessing unit to determine an 

3 averaged peak-to-mean ratio for the cuprent audio frame by (i) monitoring a 

4 maximum averaged peak-to-meap<^tio and a minimum averaged peak-to- 

5 mean ratio, (ii) determinineX^irst result being a difference between the 

6 maximum averaged og^-to-mean ratio and the averaged peak-to-mean ratio 

7 for the current ajraio frame, (iii) determining a second result being a 

8 difference henveen the maximum averaged peak-to-mean ratio and the 
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9 minimum averaged peak-to-mean ratio, and (iv) conducting a r^titSoetween 

10 the first result and the second result to produce the pe^iH:6^ean likelihood 

11 ratio. 

1^ \ / 

M 13. A machine readable medium/haying embodied thereon a 
(U.computer program for processing by a machine, the computer program 
mp rising: 

a first routine for determfeiing a peak-to-mean likelihood ratio; and 

A 

a second routine forcofhmring the peak-to-mean likelihood ratio to a 
selected threshold to determine whether an audio frame being transmitted 




5 
6 
7 

1 
2 
3 
4 
5 
6 
7 
8 



represents a voice signal 



\ 1 



14. The machine readable medium of claim 13, wherein the 
compWer program further comprising: 

" third routine for determining a short-term averaged energy for the 
audio frSme, the third routine being executed before the first and second 
routines, 

rth routine for determining a long-term averaged energy for the 
audio frame,\the fourth routine being executed before the first and second 
routines. 



'2 
3 
4 
5 
6 



Ip. The machine readable medium of claim 14, wherein the 
computer program further comprising: 

a fifth routine for determining whether a sum of the short-term 



averaged 
averaged 
routines; 



energy and a predetermined factor is greater than the long-term 
energy, the fifth routine being executed before the first and second 
[and 
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sixth routine for determining whether a difference between the long- 
term avei^ged energy and the short-term averaged energy is less than a 
predetermined threshold, the sixth routine being executed after determining 
that the sum is greater than the long-term averaged energy and before 
execution of the fir^and second routines. 

16. 'Xhe machine readable medium of claim 15, wherein the fifth 
routine determining that the current audio frame represents silence if the 
sum is less than tthe long-term averaged energy. 



1 17. The machine readable medium of claim 15, wherein the sixth 

2 routine determininmhat the current audio frame represents voice if the 

3 difference is greater man the predetermined threshold. 




5 
6 
7 
8 
9 
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18. A voice activity detector comprising: 

circuitry to determine a shorMerm averaged energy for an audio frame; 
circuitry to determine a long-term averaged energy for the audio frame; 
circuitry to determine whether me short-term averaged energy is 
greaterlhan the long-term averaged energy by a predetermined factor; 

circuitry to determine whether a oifference between the long-term 
averaged energy and the short-term averaged energy is less than a 
predetermined threshold when the short-term averaged energy is greater 
than the long-term averaged energy by the predetermined factor; 

circuitry to determine a peak-to-mean likelihood ratio when the 
difference between the long-term averaged energy and the short-term 
averaged energy is less than the predetermined threshold; and 
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circuitry to comparing the peak-to-mear\ likelihoocl ratio to a selected 
threshold and to determine that the audio frame represents a voice signal 
when the peak-to-mean likelihood ratio is greater man a selected threshold. 
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